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Abstract 

Using detrended fluctuation analysis (DFA), we study the scaling properties of the 
volatility time series Vi = |Tj_|_i — Ti\ of daily temperatures Tj for ten chosen sites 
around the globe. We find that the volatility is long range power-law correlated with 
an exponent 7 close to 0.8 for all sites considered here. We use this result to test 
the scaling performance of several state-of-the art global climate models and find 
that the models do not reproduce the observed scaling behavior. 
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1. Introduction 



Indications of weather persistence over months and seasons are known [1]. This 
long term persistence in the atmospheric temperature variability is analysed 
and quantified by detrended fluctuation analysis (DFA) and wavelet transform 
techniques with a fluctuation exponent of a ~ 0.65, which is independent 
of the location of the site [2]. Also, indications of persistence are confirmed 
through power spectral analysis [3,4]. Presence of such a universal persistence 
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law indicates that the processes governing the atmospheric dynamics at dif- 
ferent chmatological zones are based on similar principles [2]. Here, we are 
not interested in the temperature fluctuations around the seasonal trend, but 
in the magnitude of temperature changes between successive days, i.e the 
temperature volatility. Volatility is the concept often used in econophysics to 
indicate the fluctTiations in price changes [5]. The market is said to be more 
volatile if the fluctuations in price changes are high [6]. In recent years this 
concept has been applied to cardiac system and found that cardiac volatility 
series is long range correlated [7]. 

Here we study the volatility of atmospheric temperature data obtained from 
10 randomly chosen meteorological stations in Europe, North America, Asia, 
Russia and Australia, from various chmatological zones. Correlations in the 
volatihty series give information about persistence in the changes. If, for ex- 
ample, the change in the temperature between two successive days is small 
there is a high tendency that the change remains similar for the next consecu- 
tive days. Here, we study long-term temperature records (typically 100 years). 
We use detrended fluctuation analysis (DFA) up to the order of flve [8,9,10], 
which systematically eliminates higher order trends and reveals the correla- 
tions present in the highly non-stationary data. Our analysis shows that (i) 
the persistence, characterised by the correlation C{s) of the volatility series 
separated by s days, follows a power law, C{s) ~ s~'^, with roughly the same 
exponent 7 = 0.8 for all stations considered, and that (ii) the range of this 
universal persistence law seems exceed one decade. 



2. Methodology of Scaling analysis 

We have performed the scahng analysis of volatility series for the records of 
the maximum daily temperature Tj of the following weather stations: Prague 
(218 yr), Melbourne (136 yr), Luling (90 yr), Seoul (86 yr), Kasan (96 yr), 
Vancouver (93 yr), Tashkent (97 yr). New York city (116 yr), Brookings (99 
yr) and St. Petersburg (111 yr). The numbers within (.) are the length of 
the records. Prom the daily maximum temperature series, we construct the 
volatility series K;=|Tj+i — Ti|. Then we remove the chmatological annual cycle 
[11] from Vi to obtain AVi = Vi-<Vi>. 

Qualitatively, persistence is clearly seen (patches of high and low volatihty) 
in the plot of 1^ as shown in Pig. la for one year in Prague. Pigure lb shows 
the volatility series AVi around the mean. Pigures le and If show the plot 
of Vi and AVi respectively, obtained from the phase randomised surrogate 
data (preserving the distribution) [12] of the temperature increment series 
(Tj_|_i — Tj). Quantitatively, persistence in AVi can be characterised by the 
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(auto) correlation function, 

1 N-s 

C{s) = {AVi AVi+s) = -^r^ E (1) 

where N is the number of days in the record. A direct calculation of C{s) 
is hindered by the level of noise present in finite temperature series, and by 
possible nonstationarities in the data. Following [13,14], we do not calculate 
C{s) directly, rather we study the fluctuations in the volatility profile Yn = 
Z^iLi divide the profile into non overlapping time windows of length 

s and determine the squared fluctuations of the proflle (as specified below) 
in each segment. The mean square fiuctuations, averaged over all segments 
of length s, are related to the correlation function C{s) (see below). For this 
study we employ a hierarchy of methods that differ in the way the fluctuations 
are measured and possible nonstationarities are eliminated (see e.g. ,[10, 15] for 
detailed description of the methods) : 

(i) In the simple fluctuation analysis (FA), we calculate the difference of the 
proflle at both ends of each segment. The square of this difference represents 
the square of the fluctuations in each segment. 

(ii) In the first order detrended fluctuation analysis, we determine in each 
segment the best linear flt of the proflle. The variance of the proflle from this 
straight line represents the square of the fluctuations in each segment. 

(iii) In general, in the n-th order DFA we determine in each segment the 
best n-th order polynomial flt of the proflle. The variance of the proflle from 

these best n-th order polynomials represents the square of the fluctuations 
in each segment. The fluctuation function F{s) is the root mean square of 
the fluctuations in all segments. For the relevant case of long-term power-law 
correlations, C{s) ~ s~^, with < 7 < 1, the fluctuation function increases 
with s according to a power law [16] , 

F(s) ~ s^ « = 1 - 1- (2) 

For uncorrelated as well as short range correlated data, we have a — j. For 
long range correlated data we have a > |. 

By deflnition, FA does not eliminate trends similar to the Hurst method and 
the conventional power spectral methods [17]. In contrast, DFAn eliminates 

trends of order n in the proflle and n — 1 in the original time series. Thus, from 
the comparison of fluctuation functions F{s) obtained from different methods 
one can learn about long term correlations and types of trends, which cannot 
be achieved by the conventional techniques. 
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Fig. 1. Temperature volatility series for Prague (ly), before removing the (climato- 
logical) annual cycle, (b) Same data as in (a) but after removing the annual cycle. 

(c) Profile function of the volatility series, (d) Fluctuation functions F[s) obtained 
from the profile function shown in (c). Curves from top to bottom represent F{s) 
obtained from FA (A) and DFAl-5 (o). Panels (e)-(h) represent the same quantities 
as in (a)-(d) for phase randomised surrogate data (preserving the distribution) of 
the temperature increment series (Tj+i — Tj). Solid lines at the bottom of the panels 

(d) and (h) are lines with slopes 0.6 and 0.5 respectively. The unit of temperature 
volatility Vi and its mean AV^ is °C/(i. Scale of F{s) shown in (d) and (h) are 
arbitrary and the unit of s, n and d is a day. 



3. Scaling analysis of Temperature volatility series 



We begin the analysis with the volatility series AVj for Prague which is the 
longest series (218 yr) in this study. Figure Ic shows the profile, and Fig. Id 
shows the fluctuation functions obtained from FA (A) and DFAl-5 (o). In the 
log-log plot, all curves are approximately straight lines with a slope of a close 
to 0.6. This result suggests that there exists a long term persistence expressed 
by the power law decay of correlation exponent 7 = 0.8. There is slight upward 
bend in the F{s) curve obtained from FA. This shows that there is a trend in 
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Fig. 2. Fluctuation functions F{s) obtained from temperature volatility series for 
four different continental sites: Tashkent (Asia), Melbourne (Australia), New York 
(US) and Luling (US). Top two curves in each panel represent the F{s) obtained 
from FA (a) and DFA2 (•) of the volatility series. The third and fourth curves in 
each panel represent the F[s) obtained from FA (A) and DFA2 (o) of volatility 
series derived from the surrogate data. Scale of F{s) is arbitrary and the unit of 
s is a day. Solid lines shown at the top of each panel is the line with slope 0.1 for 
comparison. Note that we plot F(s)/s°-^ so that a slope of 0.1 corresponds to 0.6. 

the volatility series, as observed in [2] for the temperature data. Hence, the 
effect of the city growth in Prague does not lead only to an increase in the mean 
temperature of the city, but also to an increase in the temperature volatility. 
This trend is not removed in FA while DFAl and subsequently higher orders 
of DFA have removed it. Figure Ig shows the profile of the volatility series 
obtained from the surrogate data of the temperature increment series. When 
compared with Fig. Ic, profile shows much less fluctuations. Figure Ih shows 
the fluctuation functions F{s) obtained from FA (A) and DFAl-5 (o) for the 
profile shown in Fig. Ig. In the log-log plot, the fluctuation functions F{s) are 
straight fines with a slope of a ~ \. This shows that the original data have 
nonlinear features [18,19]. 

Figures 2(a-d) sfiow tfie results of volatility analysis for four different conti- 
nental sites: Tasfikent from Asia (97 yr). New York city (116 yr), Melbourne 
(136 yr) and Luling from Texas (90 yr). Tfie top two curves in eacfi panel are 



5 



the fluctuation curves obtained from FA (A) and DFA 2 (•) for volatility se- 
ries of temperature data while the third and fourth curves are those obtained 
from the same analyses (FA and DFA2) of volatility series derived from the 
surrogate data. For the sake of clarity, we have divided the fluctuation func- 
tion F{s) by The straight line shown in each panel has a slope 0.1. The 
scaled fluctuation functions F(s)/s^'^ obtained from the original series AVi 
(the first and second curves in all panels shown in Fig. 2) are almost parallel 
to the line with slope 0.1. The fluctuation functions F[s)/s^'^ obtained from 
the surrogate data (third and fourth curves in all panels shown in Fig. 2) are 
all parallel to x-axis. 

We obtained similar results with exponents between 0.58 and 0.63 for all 10 
sites analysed. Thus, there exist a quite general persistence law in the volatil- 
ity series obtained from temperature data. The existence of such a general 
persistence law in the temperature volatility series indicates that the change 
in the temperature in different climatic zones may be governed by the same 
basic principles, leading to similar fluctuations in volatility series in different 
places. 



4 Testing the volatility scaling performance of simulated tempera- 
ture records 

We use this result to test the state-of-the art global climate models. In an 
earlier work, Govindan et al.[20], have shown that temperature data simu- 
lated by GCMs violate the observed scaling behavior [2]. Here we consider 
the scaling analysis of the temperature volatility time series. We concen- 
trate on four models, for which data for all three scenarios viz. Control run 
(CR), (ii) greenhouse gas forcing only (GHGF) and (iii) greenhouse gas plus 
aerosol forcing (GHGPS), are available for the same simulation period. The 
GCMs are: (i) CSIR0-Mk2 (Melbourne), (ii) ECHAM4/OPYC3 (Hamburg), 
(iii) CGCMl (Victoria, Canada) and (iv) CCSR/NIES (Tokyo) (see [21] for 
details) . 

In CR, the CO2 content is flxed. In GHGF scenario, one mainly considers the 
effect of greenhouse gases. The amount of greenhouse gas forcing is taken from 
the historic data until 1990 and then increased at a rate of 1% per year. In 
GHGPS scenario, the effect of aerosols (mainly sulphates) in the atmosphere 
is taken into account which can mitigate and partially offset the greenhouse 
warming. Although this scenario represents an important step towards com- 
prehensive climate simulation, the precise role of aerosols in the mechanism 
of climate modelling is still unclear. 

The temperature data simulated by these four models for three different sce- 
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narios arc available from the IPCC Data Distribution Center [21]. We ex- 
tracted data for nine representative sites around the globe (Prague, Mel- 
bourne, Seoul, Vancouver, Kasan, Luling, Tashkent, New York and St. Pe- 
tersburg). For each model and each scenario, we selected the temperature 
records of the four grid points closest to each site, and bilinearly interpolated 
the data to the location of the site. 

We generate volatility series from the monthly temperature series by the same 
procedure (explained earlier) for daily temperature data. We removed the 
seasonal periodic trends in Tj (before constructing the increment series) and 
also in the volatility series Vi. Since the GCMs simulate temperature data in 
monthly time scales, for comparison, we averaged the daily temperature (ob- 
served) data (of the nine climatological stations) to monthly data. We present 
our results of scaling analysis of volatility series obtained from temperature 
data simulated by models, for three different scenarios and also for the ob- 
served data in the form of histograms e.g. Fig. 3. 

The scaling exponents a obtained for the observed data are shown in the 
topmost panel (for comparison they are plotted repeatedly for three times 
corresponding to three different scenarios). We can see immediately that all 
sites occupy the bin corresponding to the a value in the range of 0.58 to 0.62 
(see Fig. 3 top panels) indicating the universal behavior as found in the daily 
data. The grey color boxes represent the a values of the volatility series derived 
from surrogate series. For the case of monthly data the a values obtained from 
the volatility series of surrogate data exhibit correlated behavior for some of 
the sites. However, they arc well below the values of the original data. When we 
compare the results obtained from GCMs, none of the models show a unique 
behavior as found in the observed data. The a values arc distributed widely for 
all the models. The comparison of the exponents of volatility series obtained 
from original data and that of surrogate data shows their difference is close 
to zero and even zero for some of the sites. For instance, in CSIRO model, for 
each of the three different scenarios the exponent values of the volatility scries 
of the original data and that derived from surrogate series, have same value 
for St. Petersburg(9). Likewise, in ECHAM4, similar conclusion can be drawn 
for Kasan(5) as well as Luling(6). However, the distribution of a values of 
volatility series obtained from surrogate data, towards higher values is clearly 
seen for the well established scenarios like GHGF and GHGPS. 

It follows from our analysis that there is a universal persistence in the volatility 
series obtained from temperature increment series with an exponent of 0.60 ± 
0.03. When we use this result to test the scaling performance of the virtual 
climate records simulated by GCMs, we find that (i) models data display wide 
range of exponent values, (ii) surrogate analysis suggests that models data 
lack nonlinearity, especially for the well established scenarios, for some of the 
sites considered here. 
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Fig. 3. Histogram of DFA exponent a for volatility series obtained from monthly 
temperature data for nine different sites: Prague(l), MeIbourne(2), Seoul(3), Van- 
couver(4), Kasan(5), Luling(6), Tashkent(7), New York(8) and St. Petersburg(9). 
Top three panels represent observed data. The other panels represent the exponents 
of volatility series obtained from temperature time series simulated by four dif- 
ferent climate models (CSIR0-Mk2, ECHAM4/OPYC3, CGCMl and CCSR) for 
three different scenarios, namely control run (CR), Greenhouse gas only forcing 
(GHGF) and Greenhouse gas plus sulphate forcing (GHGPS). Open boxes repre- 
sent the exponents obtained for volatility series of original data while grey boxes 
represent the exponents obtained for volatility series derived from surrogate data of 
the corresponding temperature increment series. 
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